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Abstract 

We consider a resource management problem in a multi-cell downlink OFDMA network whereby 
• the goal is to find the optimal combination of ( i) assignment of users to base stations and ( ii) resource 

allocation strategies at each base station. Efficient resource management protocols must rely on users 
truthfully reporting privately held information such as downhnk channel states. However, individual 
\^ , users can manipulate the resulting resource allocation (by misreporting their private information) if by 

' doing so can improve their payoff. Therefore, it is of interest to design efficient resource management 

protocols that are strategy-proof, i.e. it is in the users' best interests to truthfully report their private 
information. Unfortunately, we show that the implementation of any protocol that is efficient and 
strategy-proof is NP-hard. Thus, we propose a computationally tractable strategy-proof mechanism 
(/3 ' that is approximately efficient, i.e. the solution obtained yields at least ^ of the optimal throughput. 

, ^ , I Simulations are provided to illustrate the effectiveness of the proposed mechanism. 

! Index Terms 

> 

' Heterogenous Network, Mechanism Design, Resource Allocation, Base Station Association, Ap- 

' proximation Bounds, Computational Complexity, Nash Equilibrium, Price of Anarchy 

^ ■ I. Introduction 



We consider a downlink OFDMA network with multiple base stations (BSs) serving a group of 
^ ■ users. The BSs operate on non-overlapping spectrum bands in frequency division duplex (FDD) mode. 

The objective is to find the best per-BS resource allocation strategy and the user-BS assignment to 
achieve spectral efficiency and load balancing across the networks. This problem is well motivated by 
IT^ ' many practical networks such as the multi-technology heterogenous networks (HetNet) ||T1, ||2l, the 

^ i IEEE 802.22 Wireless Regional Area Network (WRAN) [i3J or a Wi-Fi network with multiple access 

points m. For example, in the HetNet, multiple wireless access technologies such as Wi-Fi, LTE or 
WiMAX are available for the same region. These networks operate on different spectrum bands and 
all utilize OFDMA for downlink transmission. Integrating these different radio access technologies and 
making them available to the user devices can significantly increase the overall spectral efficiency as 
well as achieve load balancing across different networks. The mobile users can choose from one of 
the technologies/networks for association, and they can switch between different technologies/networks 
to avoid congestion (i.e., "vertical handoff ' operation, see IB)- The user-network assignment and the 
per-network resource allocation need to be performed jointly to achieve optimal network-wide resource 
allocation. 

There are three major challenges for optimal resource allocation in such networks. 
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1) When operating in FDD mode, the network requires the users to measure and report the downlink 
channel states for efficient resource allocation. An untruthful user may incorrectly report this information 
for its own benefit. The possibilities of various forms of untruthfulness in user behaviors in wireless net- 
works have been recently noted (see e.g., ||5]-||8]). Reference |f6l has discovered that certain commercial 
802.11 devices are specially manipulated to trick the access points for better rates. Such manipulation 
can take the form of a non-uniform backoff procedure ||6] or falsely reported traffic priorities |0. 
As suggested in ||5l, in FDD cellular networks it is also possible to manipulate the devices' channel 
feedback procedure, as the compliance testing is usually limited to a few standardized scenarios. It is 
projected that the users of future networks may have stronger abihty (and incentives) for manipulating 
their devices. See ||5l for detailed hterature review in this direction. The presence of the untruthful users 
can significantly reduce the overall system performance and limit network access to truthful users. 

2) Even assuming the users truthfully report their channel states, finding the global optimal resource 
allocation is still computationally intractable (this result will be shown in Section JIl). 

3) There is no central entity to compute and enforce a desired user-network assignment and network- 
wide resource allocation ||4]. 

Consequently, a good resource allocation scheme must possess the following features: i) it should 
provide efficient utilization of the spectrum; ii) it must be strategy-proof, i.e., it is in the users' best 
interests to truthfully reveal their private information; it is distributedly implementable, in the sense 
that both the BSs and the users can take part in the scheme with only local information and local 
computation. 

A. Literature Review 

The problem of finding an optimal resource allocation for OFDMA network when the user-BS 
assignment is fixed and there is complete information on channel states has been widely studied, e.g., 
lOll- lfTll . Various algorithms are developed to solve the BS's utility maximization problems by allocating 
the transmission power and the channels in optimal ways. The joint problem of BS assignment and 
resource allocation in OFDMA network has been analyzed under complete information and the ability 
to enforce decisions from a centralized standpoint, for example, |[T2ll . |[T3l . However, in many practical 
networks there are no entities capable of performing the centralized decision making. Another strand 
of the literature deals precisely with this case by using non-cooperative game theory llT4l - llT7l . Users 
selfishly compute their power control and cell site selection strategies to maximize their own utilities. 
With proper design of the utility functions, equilibrium solutions can be obtained in distributed fashion. 
However, complete information on the channel states and/or the utility functions is assumed. The overall 
efficiencies of the identified equilibrium solutions are not characterized. 

There are many recent works that design mechanisms for resource allocation problems in networks 
with strategic users and/or incomplete information ||5], lITSl . |[T9l . It is commonly assumed that resources 
are divisible and that there is a closed-form expression that describes the interdependency of users' 
decisions. In contrast, in our problem the resource allocation decisions are mixed in nature, i.e., decisions 
such as user-BS assignment and channel assignment are discrete (not divisible), while the BSs' power 
allocation decision is continuous (divisible). In addition, the interdependency in users' decisions is 
only implicitly characterized as the solution to the optimal resource management problem at each BS. 
As a result, the problem considered in this paper does not adequately fit into any of the frameworks 
considered in the above cited papers. We mention that the recent work IH considered an incomplete 
information setting similar to ours, in which the FDD network lacks the true channel states due to the 
false report by the users. The objective though is to design optimal user scheduling algorithms, which 
is different from the objective of the present paper. 
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Lower bounds of the efficiency of the Nash EquiUbrium (NE), or the price of anarchy, have been 
analyzed for network resource allocation games. Reference |[20l considered a routing game in which the 
inefficiency is due to the selfishness of the users. Reference |[2ll analyzed a network utility maximization 
problem in which the strategic behavior of the users leads to efficiency loss. For both of the above cases 
the optimal system level problems can be solved globally, whereas in our case the overall problem is 
already difficult to solve. In |[22l . Vetta discussed the lower bounds of the NEs for a family of non- 
cooperative games assuming a special structure of the users' utility functions. Applications of this latter 
result in communication and sensor networks include 1*231 and fl^. However, these works use highly 
stylized utility functions so that the result in ll22l can be directly used. 

B. Contributions 

The main contributions of our paper are summarized as follows. 

• We show that solving the joint BS assignment and resource allocation problem, even with truthful 
users, is NP-hard. In most existing complexity results for resource management in wireless com- 
munications (e.g., ||25l . ||26l ). the hardness of the problem is mainly due to the possibility of strong 
interference among the users. In contrast, in our problem the hardness lies in its mixed (discrete 
and continuous) formulation. 

• The complexity of solving the joint problem optimally prevents the implementation of any mech- 
anism that is both efficient and strategy-proof. To ensure tractability, we design a novel mecha- 
nism that implements an approximately optimal strategy. In the proposed mechanism each user 
dynamically selects a BS to maximize its own utility, and the BSs implement the celebrated 
Vickrey-Clark-Groves mechanism (VCG) ||27| - ||291 to allocate resource under the current user-BS 
assignment profile. 

• We show that our mechanism achieves at least ^ of the optimal throughput. This result relies 
on a key observation that the per-BS optimal throughput admits certain submodular property. 
Importantly, the obtained efficiency bounds hold true with or without the presence of the untruthful 
users. To the best of our knowledge, there has been no reported work that characterizes the 
efficiency of the equilibrium solutions for the considered problem. 

The rest of the paper is organized as follows. Section |Il] formulates the problem and provides its 
complexity status. Section Hill and ITVl describe the mechanism for the resource management problem as 
well as its distributed implementation. Section |V] gives some extensions of the algorithm. Section IVll 
provides simulation results. Section fVlI] concludes the paper. 

Notations: We use bold faced characters to denote vectors. We use x[z] to denote the ith element 
of vector x. We use x_j to denote the vector [x[l], • • • , x[i — 1], x[i + 1], • • • x[A^]]. We use [y, x_j] to 
denote a vector x with its ith element replaced by y. We use V to denote componentwise maximization: 
X V y = {z|z[i] = max{x[i], y[i]}, \/ i}. J\f \ i defines a subset of TV: M\i = {j : j ^ J\f,j ^ i}. The 
main notations used in this paper are listed in Table H 

II. System Model and Problem Formulation 

We consider a service area with a set A/" = {1, 2, • • • , } of users served by a set W = {1, 2, • • • ,W} 
of BSs (or networks). Each BS w operates on the set of channels /C^, with the bandwidth of each 
channel equally set to be A/^. Let /C = U«,g>v/C^ denote the set of all channels. Suppose any two 
channels do not overlap, i.e., /C^ H /C^ = 0, M w ^ v. Such assumption is justified for example in 
the multi-technology HetNet or in the IEEE 802.22 cognitive radio Wireless Regional Area Network 
(WRAN) fS\. In the latter network, a particular geographical region may be served by multiple service 
providers (SPs), or by multiple Access Points (APs) installed by a single SP. When operating in the 
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TABLE I 

A List of Notations 





The set of all users 


W 


The set of all BSs 




The set of all channels 




The set of channels belongs to BS w 




The channel gain of user i on channel k 


p'L 


The transmit power of BS w on channel k 


Pw 


The power budget for BS if 


ok 


The user assignment of BS w on channel k 


a 


The association profile 


a^i 


The association profile without user i 


n(a) 


The rate assigned to user i 


Rw{a) 


The optimal throughput of BS w under a 


i?(a) 


The optimal system throughput under a 


^fw{a) 


The set of users associated to BS w 


(7. (a) 


The utility of user i 


T.{a) 


The tax imposed on user i 



"normal mode", the APs/SPs that serve the same region indeed operate on non-overlapping portions of 
the available spectrum, by using proper spectrum etiquette protocols (see Section 6.22 in ||3]). 

Let {l^f denote the channel gains of the set of channels from BS w to user i\ Let {'^f j^g^ 

denote the set of measured noise powers at user i on different channels. Both the channel gains and the 
noise powers are considered as private information to the users, as in the FDD mode they are measured 
at the mobile devices and then fedback to the BSs. 

Define a length N vector a as the association profile in the network, with its ith element a[i] = w 
indicating that user i is associated to BS w. Define a_j = [a[l], • • • , a[z — 1], a[z + 1], • • • a[A^]] as an 
association profile in which user i drops out of the network. For each BS w, denote the set of associated 
users as Mw{^) — {i '■ a[z] = w], which is a function of a. 

In a downlink OFDMA network, a BS w; S W can transmit to a single user i G 7Vtu(a) on a 
given channel k G Kw Let (3^ = {/3^}fce/c„ be a feasible channel assignment scheme for BS w, i.e., 
(3^ = i G A/'^(a) means channel k is assigned to user i. Let = {Pw}keK,, be a feasible power 
allocation scheme for BS w: pw > 0, X]a:gac Pw — Pw^ where pyj is the power budget for BS w. Let 

/3 - {Pw}w(iW and p = {pw}w(iW- 

Let us define rj(/3, p, a) as the transmission rate that user i can obtain under the resource allocation 
scheme (/3,p,a). With continuous rate adaptation, this rate can be expressed as: 

r,(/3,p,a)= ^/aH log ( 1 + ^^^^^1 {/^aW = 4 ) 

where !{•} is the indicator function; r is the capacity gap which is determined by the target Bit Error 
Rate (BER) as: r = - '"(^be^) ^^^^ |[30l ). 

The objective of the resource allocation is to find the tuple p, a) that achieves efficient spectrum 
utilization within each BS/network while balancing the loads across different BSs/networks. Mathemat- 
ically, we formulate the overall resource allocation problem as follows 

max V V r,(/3,p,a) (SYS) 

a,/3,p ^ — ^ ^ — ^ 

s.t. a[i] e W, y i eAf, 

/3^^GA/'^(a),VfcG/C„,Vu;G>V, 

Pt« > 0, Y Pw <P^o, V w e w. 

The load balancing property of this formulation is manifested by: 1) introducing the association as 
a decision variable for the users; 2) including the weighting factors {a^ > 0}^=i in the objective. 
The first factor enables the users to effectively avoid congestion by switching to light-loaded BSs in 
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a timely fashion, while the second factor allows the network operator to further shift the traffic to the 
BSs with larger weights. 

A. The BSs ' Resource Management: Optimal Channel Assignment and Power Allocation 

We first describe each BS's optimal resource management strategy. Let us assume that each BS w has 
perfect knowledge of downlink normalized channel states of its associated users h„, = { j.^^^ 
(this assumption will be relaxed later). 

First consider a simple case where a equal power allocation strategy is used, that is: = , V A; G 
JCw Each BS w then optimizes its throughput by picking a suitable user to serve on each channel. 
Mathematically, it solves the following channel assignment (CA) problem 

max aw rj;(/3,p,a) (CA) 

s.t. /3^;GAA^(a),Vfce/C,„. 
The optimal solution to this problem is to assign each channel to the best user Q: 

(/3M*=i*, where r G arg max ^4-. (2) 

ieA/'„(a) rn.J- 

On the other hand, if the BSs can optimize both its channel assignment and power allocation, then a 
BS w solves the following channel assignment and power allocation (CAPA) problem: 

max aw \^ ri(/3,p,a) (CAPA) 



S.t. ^ pt< Pw, pt >0, Pi& ^fw{a), V fc G /c^ 



fce/c„ 



The optimal solution to this problem can be written in closed form lITTI : 



fc |2 



Ww)* = where i* G arg maxigA/-^ (a) ^ 



M J2keK„ {ptT ~ P^ ] =0 



A 



(3) 



where A > is the dual variable associated with the power budget constraint. We note that the power 
constraint is binding at the optimal solution: J2k&K (PwT = Pw 

With some abuse of notations, we use rj(a) to denote the optimal rate for user i obtained by using 
either the CA or CAPA strategy (the actual strategy used will be indicated using a superscript CA or 
CAPA when necessary). When a is fixed, we denote the weighted optimal throughput of BS w by: 

Rn,{a.) = aw J2ieAfu:{>^) ^i(^)- 

B. The Overall Resource Management Problem: Complexity Status 

In this subsection, we investigate the complexity status of the throughput optimization problem (SYS). 
A tuple (/3*, p*, a*) is an optimal solution of the problem (SYS) only if each BS uses the CAPA strategy. 
Although finding the CAPA solution is easy when the user-BS association is fixed, the problem turns 
out to be intractable when the association becomes an optimization variable. The proof of this theorem 
is provided in the Appendix. 

Theorem 1: Finding the optimal solution to the problem (SYS) is strongly NP-hard. 

When each BS is restricted to allocate resource by channel assignment only, then the CA strategy is 
optimal for the per-BS problem. In this case, the problem (SYS) is still intractable. 
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Corollary 1: When the CA strategy is used for resource allocation within each BS, finding the best 
BS association that maximizes the system throughput is strongly NP-hard. 

The above results establish the complexity status for the problem (SYS) in a general network 
configuration. In a special case where each BS operates on a single channel, the problem is equivalent 
to a maximum weighted matching problem, which is efficiently solvable. 

Corollary 2: When \JCw\ = 1, V w € W, problem (SYS) is polynomial time solvable. 

III. Mechanism Design for Joint BS Association and Resource Allocation 

The previous section analyzes the per-BS and the overall resource allocation problem assuming 
complete information at the network side. However, in an FDD network, there is an intrinsic asymmetry 
in the available information at the BSs and at the users, as the downlink channel and the noise powers 
are measured by the users. Strategic (selfish) users can exploit such asymmetry of information for their 
own benefit by tampering with the devices if necessary |]5|. We now provide a simple illustration of 
the potential inefficiency caused by the manipulation of channel state information. 

Example 1: Consider the network consisting of 1 BS and 2 users with 3 channels. Let the noise 
power = 1 for all i, k, and let r = 1. Assume that the BS has a total power of 3. The channel gains 
are listed in Table JIl 

TABLE II 

Channel Gains For Example I. 





\hl\' 






user 1 (i = 1) 


2 


2 


1 


user 2 (i = 2) 


0.5 


0.5 


2 



When all the users report truthfully, and when the CA strategy is used, user 1 will be scheduled on 
channel 1 and 2, while user 2 will be scheduled on channel 3. A throughput of 31og(l + 2) « 3.29 
nats/s can be obtained. When user 1 remains truthful but user 2 becomes selfish, and it falsely reports 
its channels as (3, 3, 2) (instead of (0.5, 0.5, 2)), the BS will assign all the channels to user 2. After 
the channel assignment, the actual rate that user 2 obtains still depends on its true channels Q. Thus a 
throughput of 2 log(l + 0.5) + log(l + 2) 1.91 nats/s will be obtained, which is only about 58% of 
the optimal system throughput. In contrast, user 2's untruthful behavior leads to its own rate increase 
of over 70%, at the expense of starving user 1. ■ 

A. The VCG mechanism 

The optimization of system performance when strategic users have private information can be formu- 
lated as a mechanism design problem. Assuming users have quasilinear utility, a system of incentives 
(interference taxes) may be put in place in order to align individual users' preferences with the goal 
of optimizing system performance. The goal therefore is to find the interference taxes that support the 
implementation of efficient resource allocation in dominant strategies, i.e. for each user, the truthful 
revelation of channel state information is optimal regardless of the information reported by all other 
users. The search for mechanisms is typically restricted to the class of direct mechanisms in which 
users report their private information to a third-party, which in turn allocates resources and implements 
a system of incentives via taxes. 

'Such rate can be achieved via the use of rateless codes. We refer the readers to ||5] Section II] for detailed explanation of 
achieving such rate when BSs do not have perfect knowledge of the actual channel. 
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The celebrated VCG mechanism achieves this goal by having users report their privately held 
information on channel states to a central controller (CU), who computes the globally optimal solution 
of (SYS) given the reported information. The CU then assigns the users to the BSs and a given rate 
according to the solution of the optimal solution of (SYS). Each user, when attempting to manipulate the 
allocation of resources by misreporting channel state information, is penalizes for the deterioration of 
system performance for all other users. This is the basis for the VCG mechanism being strategy-proof, 
i.e. for each individual user, truthful revelation of channel state information is optimal regardless of the 
information reported by all other users. It should be also emphasized here that any other direct and 
strategy-proof mechanism implementing the solution to (SYS) is an instance of the VCG mechanism 
with interference taxes modified by a constant (see IIBTI Corollary 5.1]). Unfortunately, in the previous 
section we showed that finding the the global optimal solution of (SYS) is an NP-hard problem. Thus, 
a computationally tractable direct mechanism cannot be both strategy-proof and efficient. 

Our strategy for designing a computationally tractable mechanism is to relax the requirement of op- 
timality so that an approximately optimal solution of (SYS) can be implemented in dominant strategies. 
In the mechanism proposed below, tractability is achieved by: (i) decentralizing the resource allocation 
decisions to each BS and implementing the VCG mechanism in a per BS basis (section flll-BI below): 
(ii) allowing the users to dynamically adjust their choices of association (section UlI-CI below). 

B. Implementing VCG at each BS with Fixed Association 

We formally describe the implementation of the VCG mechanism for given user-BS association 
profile a. Recall that optimal per-BS strategies were described in Section Hl-AI 

Define the normalized channels as h^,^ = {^}^._^^^; h_j,^ ^ {hj,«>beA^„(a)\»- Let = [hi^^, h_i,^]. 

Define user i's reported normalized channels as hj Define h_j and similarly. When we take 
untruthfulness into consideration, a user i's rate depends on the following two terms: 1) the reported 
normalized channel, denoted as h^„, by which the BS makes the resource allocation decision; 2) 
the actual normalized channel hj by which user i experiences the actual rate. We signify such 
dependencies by using rj(a; hj h^) to denote user i's rate. If the information reported by the users 
is h^, a tax Ti(a; h^^) will be levied upon user i, and its net utility is 

Ui{BL; hi,„, h_.^) = at„ri(a; hi^^h^) - r,(a; h„,). (4) 

The tax assessed on user i is computed based on the reported channels. It is given as the total rate 
improvement that the set of remaining users Mw (a) \ i can obtain if user i leaves BS w: 

7i(a;h„,)= ^ a^r^ (a.^; hj,^,, h_,;^u,) - ^ a^.r^ (a; h^^^, h^) . (5) 

jeA/'„(a_,) jeM„(a)\i 

^ V ' V ' 

optimal throughput when i drops out optimal throughput of other users when i is present 

The tax expressed in (jSj ensures that each user has an incentives to act truthfully. To see why this is 
note that when all the users truthfully report their channels, BS w can maximize its throughput. For 
example, the optimal throughput is better than that achieved when user i reports untruthfully: 

) (6) 

From this property we can derive the following key inequality 

r^(a;h^,^,,h„) + ^ rj(a; h^,^, h^,) < ri(a; h.^^^^, [b;,^, h_^,^]) + ^ rj{a;h 

j^wi [hj^it?; (7) 
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The sum of the two terms in the right hand side of (|7]i represents the sum rate achieved by all 
the users in BS w assuming the actual channels are [h^^u,, h_i,i„], and the users also truthfully report 
[hj h_i,^]. Consequently, this inequality follows from the fact that the BS's resource management 
strategy maximizes the sum rate when the channels are truthfully reported. 
Now suppose user i reports untruthfully (i.e., hj „, ^ hj then we have 

= ri(a; hi^^, hm) — i ^ ^ ''j hj.u;, h_.i^^) — ^ ^ ^^(a; hj^^,, h^,) j 

< rj(a; hj^u,, [hi.^,, h_i^tu]) + ^ ^ rj(a; h^^^, [h^.^,, h_i^u,]) — ^ ^ rj(a_j; hj^^, h_i^u,) 

jeAC(a)\i ieA/™(a_i) 

(c) ^ 

— C/i(a; hj^iL,, [hi^u,, h_i^tu]) 

where (a) and (c) are from the definitions of the utility in (6) is from We have then established 
desired result. 

In the reminder of this paper, we will assume that the VCG mechanism is implemented at each BS. 
Thus, we will simply write 7'i(a) instead of ri(a; h^^u,, h^,). The user i's tax term ^ and utility term (HJ 
can be simplified as (assuming a[i\ = w) 

Ti{a) ^ ^ a^rj(a_,)- ^ a^rj{a) (8) 

ieA/'„(a_i) jeAA„(a)\i 

[/i(a) = a„,ri(a) - Tj(a) = ^ a„rj(a) - ^ a^rj(a_i). (9) 

iGM„(a) jeAC(a_i) 

In summary, by using the VCG mechanism within each BS, all the users will act truthfully, which in 
turn allows the BSs to optimally implement their resource allocation strategies. It is important to note 
here, that even in the ideal scenario where all the users behave truthfully, the tax and utility function 
defined in dSjl and ^ are still extremely useful. As will be seen in the subsequent sections, they lead 
to simple and efficient network-wide resource allocation. 

C. The User-BS Association Game 

Suppose users are allowed to autonomously select which BS to connect to. Assuming each BS 
implements a VCG mechanism, we are left with a user-BS association game. We will occasionally use 
the superscripts CA or CAPA to specify the strategies used by the BSs. Let us define a non-cooperative 
BS association game as: Q = {M, {xijieA/') {U^{■)}^£^f}, where Xi = VV is the strategy space of user 
i; Ui{-) is the utility of user i as defined in 

Interestingly, unlike most conventional games, in game Q, the interdependencies of the users' strate- 
gies are only implicitly given. For example, suppose a[i] = w, a[j] = q. In order to assess the impact 
of user i's change of association from BS w to BS q on user j's utility, BS g's resource allocation 
problem (either CA or CAPA) needs to be solved. There is no closed-form expression governing the 
users' interdependencies. This unique property of the game makes our subsequent analysis, particularly 
the efficiency of the NE of game Q, very involved. 

We first characterize the utility function J7i(a) and the tax function rj(a). 

Proposition 1: When all the BSs use either the CA or the CAPA strategy, Ti{a) > 0, V i € M. 
Moreover, the users' utility functions are bounded: < Ui{a.) < aa[j]rj(a). 

Proof: For notational simplicity, let w = a[i] be the association of user i. Suppose that the BSs use the 
CA strategy. Observe that as each channel vacated by user i's departure will be re-assigned to some 
user j G Af-wi^) \ i, and the assignment of all the other channels remains the same. Consequently, the 
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rates of those users that have new channels will increase. This leads to Tp^{a) > 0, which in turn 
implies that Up^{aL) < ay;rf^{a.). To show Up^{sL) > 0, observe 



max 



ie7V™(a)\i n' 



, < max 

■ J 6 A/'™ (a) 



, y k e ICn 



(10) 



This inequality combined with the monotonicity of the log function and the structure of the solution 
for the CA problem (cf. ^) implies: 



E 



a,„r 



CA 



{a-^)< a^rf^{a)+a,,r';^^{a.). 



(11) 



j6AC(a_,) je^f^.{8L)\^ 

Rearranging terms, and plugging in the definition of tax in we have: au]rf^{a.) — Tp^{a) = 
Up^{a) > 0. The CAPA case can be argued along the same lines. ■ 

D. Characterization of Nash Equilibria 

In this subsection we present a series of results characterizing the pure NEs for game Q. 
For a fixed a, we define BRi{a) as the set of "better-reply" BSs for user i: 

BR,{a) ^ {w\U^i[w,a^,]) > U,i[a[T],a^,]),w £ W}. (12) 

The pure strategy NE of the game ^ is a profile a* in which BRi{a*) = 0, ^ i £ M. Equivalently, 
all users prefer to stay in their current BSs: [/, (a*) > max^,gvv Ui {[w, a*_^]) , \f i e M. 

Let i?(a) = ^^^y\; ctwRwist-) denote the weighted system throughput for fixed association a. Our 
first result analyzes the existence of the pure NE of game G- The proof can be found in the Appendix. 

Theorem 2: The game Q must admit at least one pure NE. In particular, the association profile 
a € arg maxa -R(a) must be a pure NE of this game. 

The existence of pure NE for the game Q could be attributed to the tax charged by the BSs. Without 
such tax, there could be no pure NE. To illustrate, define a new game in which users are not charged 
with taxation, and their utilities are just their rates: Q = {M ,{xi}i<^Ni{fi{')}ieN}- We claim that if 
all the BSs use either CA or CAPA strategy, this game does not always admit a pure NE. We show 
this claim by giving two counterexamples. 



Example 2: When the BSs use CA strategy, consider a network with W = 2, N = 



1 



and \lCw\ = 2, \/ w. The channel gains are given in the top part of Table JII] Let = 1, V i,k, 



P 



2, V w. When BSs use CAPA strategy, consider a network with W = 2, N = 3, = I and 
\]Cw\ = 2, \f w. The channel gains are given in Table JV] Let = 1, V i, k, = 5, V w. For both 
examples, we show in Table |V] that in every possible association profile, there exists at least one user 
whose better-reply set is nonempty. ■ 



TABLE III 

Channel Gains For Example[2]-CA Case. 



TABLE IV 

Channel Gains For Example[2]-CAPA Case. 



CA case 










1=1 


2 


0.1 


2.2 


0.1 


1=2 


0.5 


2.5 


0.1 


2.6 


1=3 


0.1 


2.4 


2.3 


0.2 



CAPA case 










1=1 


i 


i 


i 

fi.4 


i 


1=2 












1=3 





1 

4 


1 

6 






Example ID illustrates that it is the interference tax imposed by the BSs that ensures the existence of 
the pure NE for game Q. In fact, such tax also guarantees the efficiency of the outcome of the game. 
Theorem |2] asserts that the maximum weighted throughput achievable by all the NEs is the same as 
the optimal system weighted throughput. In the following, we further provide a lower bound for the 
efficiency of the NEs. Central to the derivation of such lower bound is certain submodular property of the 
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TABLE V 

The Better-Reply Sets for uses Under Different System Association Profiles. 



Association Profile 


Better-Reply Set (CA Example) 


Better-Reply Set (CAPA Example) 


[1,1,1] 


= 2 


[1,1,1]) = 2 


[1,1,2] 


i3i?^'^([l,l,2]) = 2 


BR^^^^^{[1,1,2]) = 2 


[1,2,2] 


Si?^^([l,2,2]) = 1 


BR'^^^^{[1,2,2]) ^ 1 


[1,2,1] 


BR\:'^ {[1,2,1]) ^ 2 


Bi?^'^^^([l,2,l]) ^ 2 


[2,2,1] 


i3i?^^([2,2,l]) = l 


Bi?^'^^'^([2,2,l]) = 1 


[2,1,1] 


i3i?^'^([2,l,l]) = 2 


BR^'^^^{[2,1,1]) 2 


[2,1,2] 


Bi?^^([2,l,2]) = 1 


5^c.vi/^A([2,i,2]) = 1 


[2,2,2] 


Bi?^'^([2,2,2]) = l 


Bi?5r''^^^([2,2,2]) = 1 



per-BS throughput function Rui{-)- Note that i?u,(a) depends on the association profile a only through 
the set of associated users J\fw{sL). We can then rewrite i?«,(a) as i?^(A/"i„(a)), which is expressed as a 
function of the set of associated users. Then we say that is submodular if the following is true 

Rw{G u {i}) - R^,{g) < Ryj{M u {i}) - Ryj{M), y ieN, y M(^g(^M. (i3) 

The submodularity implies that there is a marginal decrease of throughput when the total number of 
associated users increases. In ||32| . Tse and Hanly have shown that for a fixed power allocation policy 
without the total power constraint, the capacity of a fading multiple access channel is a submodular 
function. However, in our case showing the submodularity of the throughput Rw{-) is much more 
involved, as our resource allocation is the solution to the underlying optimization problems, hence it is 
dynamic with respect to the set of associated users. 

Once the submodularity property is shown, we can utilize a result from Vetta |[22l to obtain the desired 
lower bound. In particular, reference [|22l introduces the notion of valid-utility games, for which lower 
bounds for the efficiency of the NE is ^. We will show that our BS selection game Q belongs to the 
family of valid-utility games. 

Theorem 3: The weighted system throughput achieved in any NE of the game Q must be at least half 
of that achieved under the optimal user-BS assignment. 

Proof: It is easy to check that Rw{-) has a monotonicity property: R^iM) < Rw{G), y M QQ. We then 
claim that Rw{-) satisfies ([T3] ). We only give proof for the (more difficult) CAPA case, the CA case 
is a straightforward extension. For simplicity of notations, we let BS w operate on all channels /C, set 
nj = l for all j, k, and let = 1. 

Fix two sets A4,Q with C (y, fix an arbitrary user i with arbitrary channel gains. Define three 
vectors g, m, h G M^, with their elements given as 

g[fc] = max|/i^|2, m[k] = max|/i*^p, h[k] = (14) 

Note g and m represent the best channel gain on each channel for the set of users Q and M, 
respectively. From the fact that C ^, we have that m < g. Note that the throughput obtained by BS 
w using the CAPA strategy is dependent on the set of associated users only through the best channel 
vector. As a result, we can also express Rw{Q) as Ru,{s), and Rw{Q U {i}) as i?^(g V h). In this 
notation, the submodular property ([T3T l is equivalent to 

R.u,{g, V h) - i?„(g) < i?(m V h) - i?„(m), Vh>0, g>m>0. (15) 

In the same token, the monotonicity of Rw{-) can be expressed as 



i?^(g) >i?u,(m),Vg>m>0. 



(16) 
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We then present a sufficient condition for ([TST i which is easier to verify. Let be a A' x 1 unit vector 
with its A;*'* element being 1. Write h = X^aLi 6fch[A;]. Then we have 

K K-l 

i?^(gVh) -i?^(g) = [i?^(gV^efeh[fc]) -i?^(gV ^ efch[A:])] +•••+ [i?^(g V eih[l]) - i?^(g)] 

k=l k=l 

K K-l 

R^iray h) - R^,{va) ^ [i?^(m V ^ efch[fc]) - i?^(m V ^ efch[A:])] +••• + [i?^(m V eih[l]) - i?^(m)] . 

k=l k=l 

In order for ([TST i to be true, it is sufficient that for all /c € /C, the following is true 

i?,„(gVefch[A:])-i?„,(g) <i?^(mVefch[fc])-i?.„(m), Vh>0, g>m>0. (17) 

Condition (fTTl) allows us to verify the submodular condition on a channel by channel basis. Partition 
the set K, into two sets: Q = {A;|m[fc] = g[fc]}, Q = {A;|m[/c] < g[A;]}. We can show that (fTTl) is true 
for all k ^ Q and k ^ Q. The proof for this result is given in the Appendix. 

To this point we have shown that ) is submodular and monotone. From Proposition [T] we have 
that J2ieAf^,{a) ^i(^) — J2w ^uiist)- Additionally, the definition of Ui{-) ensures that it is equal to 
the difference of the system throughput with and without user i (cf. As a result, game ^ is a valid 
utility game, and we can apply f22[ Theorem 3] to deduce that any NE of the game achieves at least 
a half of the optimal weighted throughput. This completes the proof. ■ 

We emphasize that all the results derived in Section IIII-CI and IIII-DI hold true regardless of the 
presence of the untruthful users, as long as the BSs implement the taxation for each user as specified 
in ([8]l and Q. This is because the association game G is built upon the assumption that the BSs use 
the VCG mechanism, and that the users are always truthful. 

IV. A Dynamic Mechanism 

In this section we introduce a mechanism that allows the users and the BSs to jointly compute a 
NE of the game Q, which is a high quality solution for the joint BS selection and resource allocation 
problem. All the results in this section are applicable to both games and ^^APA Suppose each 
user maintains a length M memory that operates in a first in first out fashion. Each user's memory is 
used to store its best associations in the last M iterations. 

We first briefly describe the main steps of the proposed mechanism. It alternates between a BS 
optimization step and a user optimization step. When it is the BSs' turn to act, based on the current set 
of associated users, each of the BSs optimally allocates the resources in its own cell using the VCG 
mechanism (cf. Section III-AI and IIII-Bb . From the system perspective, in this step, the tuple (/3, p) 
is updated while holding the association a fixed. When it is the users' turn to act, each of them first 
computes its current best BS (in terms of achieved individual utility) according to the current association 
profile. It then pushes the best BS into its memory, and randomly samples one BS from its memory 
for actual association. As we will see later, in this step, a is updated while fixing (/3, p). The sampling 
step is the key to establish the convergence of the proposed mechanism. The proposed mechanism is 
detailed in Table |Vll where the superscript (t) denotes the iteration number. 

One benefit of the proposed mechanism is that it allows the users to update at the same time without 
explicit coordination of their update sequences. A possible alternative is a sequential implementation 
that allows a single user to update in each iteration. However such scheme is undesirable as it requires 
significant coordination efforts among all the users/BSs. 

An important feature of the mechanism is that each of its steps can be implemented in a distributed 
fashion. The following two assumptions on the network are needed for such purpose: 
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TABLE VI 
The Proposed Mechanism 

51) Initialization: Let t=0, let the users choose their nearest BSs. 

52) BS Optimization: Based on current a'*^ each BS implements a VCG mechanism. 

53) User Optimization: For each user i e Af 

S3-1) Compute tlie Best BS: Compute Bi?i(a(*)); If Bi?»(a(*)) ^ 0, 

randomly select w*^*^ G BRi{aS'^^y, otherwise, set w*^*'' ~ a'^*'[i] 

*(t) 

S3-2) Update Memory: Shift w^ ^ ' into the front of the memory; 
ii t > M, shift w*^*^ ^^-^ out from the end of the memory 

S3-3) Determine tlie Next BS Association: Uniformly sample the user i's memory; 
obtain a BS index as a(*+^) [i] 

54) Continue: If a(*+i) = ait+i-m] f^j. ^ . . . , m, stop. 
Otherwise, let t=t+l, go to S2). 



1) Local channel information is known by each BS. That is, each BS w has the knowledge of 

,iej\f' '^"^ channels related to other BSs. Note that in an FDD system, this 
information is obtained via user feedback, the truthfulness of which is ensured by implementing 
the per-BS VCG mechanism; 

2) Each BS has a feedback channel to all the potential users. 

Under the above assumptions, the mechanism can be implemented distributedly. In the BSs' optimization 
step, the BSs compute the taxes and perform their per-cell resource allocation (cf. Section III-AI and 
IIII-Bb . They are not required to have the knowledge of the operational conditions or channel states 
related to other BSs. In the users' optimization step, to compute the set BRi{a^^^), each user i needs 
to know Ui{[w , aS^^j]) , V w (cf. (fT2l)). Each of these quantities can be expressed as 

aL*)]) = a^r,([«;,aL*l]) -r,(KaL*l]). (18) 

Both terms in ([TSl l can be computed by BS w and fed back to user i. To compute the first term in 
([TSl l. BS w solves its resource allocation problem with the set of users J\fw{[w,ai_\]). To obtain the 
second term in ([TSl l. BS w solves its per-cell problem with and without user i (cf. Again only local 
channel states are needed. It is important to note that to carry out the proposed mechanism, the users 
do not need to know the behaviors of their counterparts. They only need the "summary" information 
(the tax and the rate estimates) from the BSs. 

In practice, the users may only switch to a new BS if it offers significantly higher utility, because 
each of such switch induces costs such as message passing. Let us use q to denote such cost for user 
i. When switching costs are included into the decision process, in each iteration of the mechanism, 
w* G BRi{a^^'^) implies a^|]) > [//(a^*)) + q. This modification could reduce the number of 

iterations needed for convergence (since the users are now less willing to change association), but could 
also reduce the system throughput achieved by the identified NE. 

The convergence property of the proposed mechanism is provided in the following theorem. The 
details of the proof is relegated to the Appendix. 

Theorem 4: When choosing M > N, the BS association mechanism produces a sequence {a'^*^|^_|^ 
that converges to a NE of game Q with probability 1 ( w.p. 1 ). 

V. Discussions and Extensions 

To this point we have assumed that the BSs are interested in maximizing the per-BS throughput. Such 
assumption allows the BSs to have closed-form solution to their optimization problems, and it leads to 
important properties such as submodularity of the throughput functions. Our work can be extended to 
cases where the BSs allocate resources using general utility functions as well. 
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First we mention that all the previous properties of the mechanism can be straightforwardly gener- 
alized to the case where each BS w aims to maximize a weighted throughput of the form XlieA^ 
The set of weights {7, > 0}^^ can be adjusted adaptively by the BSs over time to ensure fairness 
among the users' time-averaged transmission rates (see e.g., |!33l). 

Consider an alternative case in which BS w; is interested in finding the best channel assignment to 
achieve the proportional fairness (PF). The per-BS problem is then given by lITOl 

max ^ Q;t„log(ri(/3,p,a)) (CA-PF) 

(a) 

s.t. CE A/'^(a),V fc G /C„. 

This problem generally does not admit a closed-form solution, and the BS needs to perform numerical 
search to obtain the optimal solutions (see pTOl for a set of efficient search algorithms). Let us use 
rf^(a) to denote the resulting transmission rate for user i. Following Q, each user i in cell w has the 
following utility [/PF(a) ^ log(rf ^(a)) - T^/^^Ca), where Tf^Ca) ^ a„ E,gA/-„(a.,) log(^r^(a-0) - 
Q^i" SjGM„(a)\j l°g(''r^(^))- '^^^ '^ow construct a PF association game Q^^ with each user's utility 
function given as Uf^{-). Similarly as in Theorem |2l we can show that the optimal association profile 
a* = argmaxa Xlto CKtoZ^jgAT (a) l°s('"i^^(^)) rnust be a NE of this game. Our proposed mechanism 
can be applied for finding the NE of this game. 

In general, most of the properties of the mechanism (except for the efficiency lower bounds of the 
NE) can be extended to networks with the following properties: 1) The BSs operate independently 
(using orthogonal time/frequencey resources); 2) The utility function chosen by each BS is separable 
among the associated users; 3) The optimal solutions of the per-BS optimization problem can be found. 
However, it is not clear at this point whether the monotonicity and the submodularity conditions can 
be carried over to the case of general utility functions. Showing such properties under general utility 
functions is left as a future work. 

VI. Simulations 

In this section, we present simulation results to demonstrate the performance of the proposed algo- 
rithm. Both indoor and outdoor network scenarios are considered. 

A. An Indoor Network Scenario 

We have the following settings for this part of the simulation. Let us denote a 50m x 50m indoor area 
as A; denote the 25m x 25m central area of A as C; define the border of A as B. Define the parameter 
< D < 1 as the distribution factor of the users/BSs: 1) Z) x 100% of the users and BSs are randomly 
placed in A; 2) the rest of the users are randomly placed in C and the rest of the BSs are randomly placed 
on B. When D is small, the subset of BSs that are located at the center of the area become hotspots 
and are likely to be congested. See Fig. [T] for an illustration. Let di^w denote the distance between 
user i and BS w. The channels between user i and BS w, {/iflfeeA:^' ^£ generated independently 
from the complex Gaussian distribution CA/'(0, a?^), with af^ = Li^^/ PLi^w The random variable 
Lj „, models the shadowing effect, i.e., 10 log 10(Lj ~ A/^(0, 64) is a real Gaussian random variable. 
The variable PLj „, is the pathloss between BS w and user i. To model the pathloss in the indoor 
scenario, the office environment model ||34l is used. The key simulation parameters are given in Table 
IVIII The performance of the proposed algorithm will be compared with the algorithm that first assigns 
the users to their nearest BSs, and then optimally perform the per-BS resource allocation. Note that 
this algorithm separates the process of association and per-cell resource allocation, hence in most cases 
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TABLE VII 

Simulation Parameters for the Indoor Network. 



Parameters 


Values 


Parameters 


Values 


Pw 


23 dBm 


Total Bandwidth 


80MHz 


Pass Loss (dB) 


PL,,^ = PL(1) + 26 log lO(^) + 14.1 




1 


BER 


10"" 


Length of Memory 


10 


Operating Frequency 


1.9 GHz 


Noise Power 


-100 dBm/Hz 



gives degraded system perfomiance. Throughout this subsection, the CAPA strategy will be adopted 

for per-BS resource allocation. 

The first set of experiments evaluate the convergence performance of the proposed mechanism. Fig. 

|2] plots 3 realizations of the evolution of system throughput. This figure demonstrates the ability of 

the algorithm to "track" the equilibrium solutions. The algorithm takes a few iterations to converge to 

new equilibria when the following events occur at iteration 100: 1) 10 (randomly placed) new/old users 

enter/leave the system; 2) all of the users' channel gains are re-generated (with the locations of the 

users and BSs unchanged). 

In Fig. [3l we evaluate the averaged convergence time for the algorithm. We highlight its "tracking" 
ability by adding a number of new users and by randomly re-generating all the users' channel gains 
after an equilibrium has been reached. The algorithm is able to track the equilibrium much faster than 
performing a complete restart. In Fig. 51 we plot the averaged convergence time of the algorithm with 
N = 30, K = 512. We observe that the convergence time is decreasing with D. This phenomenon is 
intuitive because when D is close to 1, the event of congestion is less likely to happen as on average 
the communication load is evenly distributed among the BSs. On the contrary, when D becomes small, 
those BSs located in the interior of the area are likely to be congested. Large portions of the users will 
then seek for alternative choices, which results in longer convergence time. Additionally, when taking 
the switching costs into consideration, the algorithm converges significantly faster. 

The second set of experiments intend to evaluate the throughput performance of the proposed 
algorithm. We first investigate a relatively small network with 10 users, 64 channels and 1 — 4 BSs, 
and compare the performance of the proposed algorithms to the global optimal solution of the problem 
(SYS) (obtained by an exhaustive search). The results are shown in Fig. |5] We see that the proposed 
algorithm, abbreviated as Distributed BS Association (DBSA), performs well with little throughput loss. 
In contrast, the nearest BS algorithm performs poorly. 

We then evaluate the performance of the algorithm in larger networks with 30 users, up to 8 BSs and 
512 channels. Fig. |6] shows the comparison of the averaged performance of the proposed algorithm and 
the nearest BS algorithm. Due to the prohibitive computation time required, we are unable to obtain 
the optimal system throughput in this case. We instead compute a (strict) upper bound of the maximum 
throughput assuming that the users can connect to multiple BSs simultaneously. We refer to this as the 
multiple-connectivity network. We also observe that when we take the switching costs into consideration 
(cj = 1 Mbps for all i), there is a slight decrease in system throughput. 

In Fig. |7J we show the distribution of the per-BS rates achieved by the proposed algorithm and 
the nearest BS algorithm. From the figure we see that the proposed algorithm is able to distribute the 
throughput to different BSs fairly, while the nearest BSs algorithm may result in severe unbalance of 
the BSs' loads (some BSs may experience heavy traffic while the rest of the BSs may become idle). 
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B. An Outdoor Multicell Cellular Network Scenario 



In this section we demonstrate the perfomiance of the proposed algorithm in a multicell OFDMA cel- 
lular network. Standard cellular network parameters are used for the simulation, see Table IVmlPl . A gam 
frequency selective channels with a Rayleigh fading component and 8 dB log-normal fading component 
are simulated. Users are assumed to be distributed uniformly in the entire network. Throughout this 
subsection, the system level PF objective is optimized, thus the CA-PF strategy discussed in Section 
IVlis used for the per-BS resource allocation. The solution to the problem (CA-PF) is computed using 
Algorithm 1 in lITOl . The main purpose of this experiment is to evaluate the performance of the proposed 
algorithm when some of the key assumptions guaranteeing the theoretical properties of the algorithm 
no longer hold true. Note that in order for the proposed algorithm to work in this network setting, 
inter-cell interference should be treated as noise. That is, user i's noise power on channel k, nf, should 
include both the environmental noise power and the inter-cell interference power 



TABLE VIII 

Simulation Parameters for the Outdoor Network 



Parameters 


Values 


Cell layout 


Hexagonal, 7 cells, 3 sectors per cell 


BS-BS distance 


2.8 km 


Frequency Reuse 


1 


pw 


49 dBm 


Pass Loss Model (dB) 


PL,,^ = 128.1 + 36.7 log 10(d,,^) 


BER 


10-** 


Total Bandwidth 


10 MHz 


Noise Power 


-169 dBm/Hz 


Multipath Time Delay Profile 


ITU-R M.1225 PedA 


Number of channel (EFT size) 


64 


Length of Individual Memory 


M = 10 



We first show the convergence of the algorithm. In the considered cellular network, different BSs 
transmit using the same spectrum bands. Consequently our theoretical analysis of the convergence is 
no longer valid. However, convergence is still observed empirically. See Table JX] for the comparison 
of the convergence speed with and without the switching costs {cj}. 

We then demonstrate the throughput performance of the algorithm. We compare the proposed al- 
gorithm with the nearest BS algorithm and the "Greedy-0" algorithm proposed in ||35l . which is a 
centralized algorithm that finds a good user-BS association by successively perturbing the user-BS 
association locally. In Table |X] and Fig. |9l we see that the proposed algorithm compares favorably with 
the other algorithms both in terms of system throughput and fairness levels. Each entry in the table is 
obtained via an average of 200 randomly generated networks. 



TABLE IX 

The Averaged Number of Iterations for Convergence 





DBSA 


DBSA Q = 0.1 Mbps 


DBSA Q = 0.5 Mbps 


N=10 


55 


33 


21 


N=30 


65 


38 


25 


N=50 


70 


40 


23 



^Most of the network parameters are taken from 11331 . In the present work only single antenna systems are simulated. 
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TABLE X 

Comparison of the System Throughput of Different Algorithms 





DBSA 


DBSA (c; = 0.1 Mbps) 


DBSA (c, = 0.5 Mbps) 


Greedy-0 


Nearest 


N=20 


97.86 Mbps 


93.08 Mbps 


90.13 Mbps 


82.23 Mbps 


63.31 Mbps 


N=40 


117.9 Mbps 


115.1 Mbps 


109.3 Mbps 


105.7 Mbps 


89.0 Mbps 


N=60 


135.9 Mbps 


129.9 Mbps 


125.1 Mbps 


119.5 Mbps 


104.8 Mbps 



Additionally, we evaluate the performance of the algorithm when only noisy channel information is 
available. In particular, we consider the situation in which the normalized channel magnitude estimated 
by the users is subject to a zero mean estimation error ||36l . ||37l . Let = i^^J — h denote 

the estimated normalized channel magnitude by user i on channel k, where 7\A(0, (af )^) is 

the estimation error Suppose the estimated normalized channels are used by the BSs for resource 
allocation. To evaluate the performance loss due to the channel inaccuracy, similarly as in ll38l . we 
introduce a term called Channel Error Ratio (CER) to quantify the strength of the channel error: 
CERf^ = lOlogio (^ ^i^^l^ y In Fig. [TOl we plot the system throughput performance with different 
values of the CER. We observe that the overall throughput degrades slightly with such inaccurate 
channel information. 

VII. Conclusion 

In this work, we studied a resource management problem in a multi-cell network in the presence of 
strategic/selfish users. We propose a novel mechanism that implements a strategy-proof and approxi- 
mately optimal scheme in dominant strategies. Utilizing a key submodularity property of the per-BS 
throughput function, we characterized the efficiency of the proposed mechanism. As a future work, we 
will study the case in which there is limited (low rate) feedback from the users to the BSs. In this case 
feedback strategy needs to be designed in conjunction with the BSs' and the users' strategies. A new 
approximation ratio needs to be derived for this more practical scenario. We also plan to generalize the 
submodularity property to other utility functions and to the case of MIMO networks. Another interesting 
extension of the current work is to include the users that are hostile instead of non-cooperative. Strategies 
that are different from pricing are needed in this case to counter the untruthfulness induced by the 
hostility. 
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IX. Appendix 

A. Proof of Theorem [7] 

Without loss of generality, we consider the case where Uu, = 1, = 1,V w,k,i. This claim is 
proved based on a polynomial time transformation from 3-SAT problem, which is a known NP-complete 
problem. The 3-SAT problem is described as follows. Given Q disjunctive clauses Ci • • • , Cq defined 
on M Boolean variables Xi--- , Xm, i.e., Cg = T^yT^yTj^ with Tij G {Xi, • • • , Xm,Xi, ■■■ , Xm}, 
the question is to check whether there exists a truth assignment for the Boolean variables such that all 
clauses are satisfied. Define vr(Cq) as the set of terms contained in clause Cg, and define T{T) as the 
index of a term T's corresponding variable, i.e., if Cm = V V X4, then 7r(Cm) = {Xi, X2, X^}, 
and I{Xi) = 1. 

Given any instance of 3-SAT problem with Q clauses and M variables, we construct an instance 
of the multi-BS network with {2Q + 1)M users and 2M + Q BSs. For each variable Xm, we do the 
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following: 1) associate with it two variable BSs Xm,Xm', 2) associate with it {2Q + 1) variable users 
{xl^, ■ ■ ■ ,Xm}, {xln, ■ ■ ■ jXm} and Um', 3) sct the individual maximum power of the BSs Xm,Xm to 
be Q, respectively; 4) let each of the BSs Xm,Xm operate on Q channels. For each clause Cq we do 
the following: 1) associate with it a single BS Cq, 2) set the maximum power of BS Cg to be 1; 3) 
let BS Cq operate on a single channel. Denote the A;*^' channel in the set as )Cuj{k). The channel 
gains between various users and BSs are given as follows. 

I VI =\ 0, otherwise, ''^-S^' =1 = ^™ ^ 7r(C,), V fc e /C^ 

0, otherwise, 

W = Xra, k = K-wiq) 

W = Cq, with X„, e TT{Cq), V fc S /C„, (19) 

otherwise. 

For example, for a clause Cq = Xi y X2 V X3, the constructed network is shown in Fig. [TT] 

In the following, for notational simplicity, for a term T G 7r(Cg), we will use the short hand notation 
T to denote its corresponding variable BSs (instead of Xx{t) or ^i(t))> use the notation yx{T) and 
■t^j(T)' ' ' ' '^^(T)} corresponding variable users. Take an arbitrary clause Cq and a term T G 

7r(C'g). Two important observations can be made at this point. Both of them are the direct consequence 
of our network construction, and we omit their proofs due to space limitation. 

Observation 1: If at the optimal solution, BS T selects its corresponding variable user yx{T) for 
transmission, then the subset of variable users {i\^rpy ' ' ' '*x(r)J' ^^^^ selected by BS T at the 

optimal solution. A direct consequence of this observation is that the maximum throughput that the BS 
pair T, T can achieve is 2Q + Q log 3. 

Observation 2: Assuming T € vr(Cg), then the maximum throughput achievable by the BSs {T, T, Cq} 
is 2Q + QlogS + 1. This throughput is achieved if and only if BS T selects user yx{T)^ BS T selects 
users {t^f^j^y ■ ■ ■ 1^(7^)} and BS Cq selects the user t'^^j.y 

Our main claim is that the given 3-SAT instance is satisfiable if and only if the constructed network 
achieves a throughput of at least Q + 2M Q + MQ log 3. 

Suppose the 3-SAT problem is satisfiable. Then for each clause Cq there is a term T* € 7r(Cq) 
such that T* = 1. Let the variable BS T* select user yx(T^), let the variable BS f* select all users 
■{^(T*)' ■ ■ ■ ' ^(r*)}' clause BS q select the variable user t^^rp^y By Observation [U the total 

throughput of variable BSs T*, T* and the clause Bs Cq is 2Q + QlogS + 1. As a result, the overall 
system throughput is 2M Q + QM log 3 + Q. 

Conversely, suppose a throughput of 2MQ + MQ log 3 + Q is achieved. From Observation [T] we see 
that the maximum throughput for a variable BS pair Xm,Xm is 2Q + Q log 3. As we have M variable 
BS pairs and Q clause BSs, then each clause BS must achieve throughput 1 in order for the system to 
achieve the throughput M{2Q + QlogS) + Q. Observation |2] implies that this is only possible if for 
each clause BS Cq, there is at least one clause BS, say T* G vr(Cq) that transmits to the corresponding 
variable user yx{T*)- Set {T*}^^^ all to 1, then each clause Cq will be satisfied. Consequently, the 
resulting 3-SAT problem is satisfied. 

B. Proof of Theorem |2] 

We prove this theorem by contradiction. Suppose a* G argmaxa i?(a), but a* is not a pure NE. Then 
there must exist a user i such that BRi{a*) / 0. Choose w G BRi{a*), and define a new association 
profile a = [w, a^J. Let w* = a*[i]. We show that user f's unilateral change of association has the 
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same effect on its own utility as well as on the system throughput 

C/»(a) - [/,(a*) = aarjia)- ^ a^rjia^,) - { ^ a„.rj(a*) - ^ a„.rj(alj) 

- ( E "'^''J' + E a»-rj(a_,0) - ( ^ a„.rj(a*)+ ^ aiErj(alJ) 

jfEA/'ijCS) iGA^„.(5_,) ieA^„.(a*) j6A^i;(alJ 

^ au, - i?»(a*)) = ~ i?(a*) (20) 

where (a) is from the definition of the utility function ^ ; (6) is due to the fact that a^,- = a_j; (c) 
is due to A/'u,*(a-i) = A/'^-(a), A/'^(a_i) = A/i;;(a*), and Ru,(sl) = Ru,{a*), y w ^ w, w*. From the 
assumption, user i prefers to switch to BS then C/j(a) > Ui{a*). This combined with (l20l ) yields 
i?(a) > -R(a*), which is a contradiction to the optimality of a*. Then we have a* e arg maxa i?(a) is 
a NE for game Q. ■ 

C. Proof of Theorem \3\ 

We show that ([TT] ) is true for all /c G Q and k E Q. 

Step 1) We first argue that for all k £ Q, ([T71) is true. When h[k] < m[k] = g[k], ^17} is trivially 
true as both sides of it evaluate to zero. We then focus on the case h[k] > m[k] = g[k]. 
Assuming h.[k] > m[fc] = g[k], we have that 

g V ekh[k] = m V ekh[k] = g + c x efc 

for some constant c > 0. Thus, for all A; G Q and h[A;] > m[A;] = g[k], to show the inequality (fTTl) . it 
suffices to show the following decreasing difference property 

R^ois + S X Gfc) -i?„,(g) < R^im + Sx ek) - R^nim), V (5 > 0, and g > m > 0. (21) 

From ||39l . we know that whenever the function i?«,(x) is differentiable with respect to x.[k], the 
decreasing difference property of (l2Ti is equivalent to the following property 

^.^ RUS + Sxe,)^ RUS) ^R^{m + 6xe,)- R^jm) ^ ^ g > m > 0. (22) 

<5-J.O S S 

In what follows, we prove that for any k with h[A;] > m[A;] = g[A;], the limit in (l22l ) exists and is 

g 



non-positive. To this end, a closer look at the function Rw{-) is necessary. Let denote the power 



allocation for channel k when the best channel gain vector is g, and let Ag denote the corresponding 
dual variable. From the CAPA strategy, we have that = [Ag — ^j^j]^- Define the active channel set 
as /Cg = {A;|Ag — > 0}. From the fact that the power constraint must be active for the CAPA 



strategy, we have that Yl,k=i Pg ^ J2keic ^ iyn = Pw^ which implies that 




(23) 



Using this expression as well as the expression for p^, we have that 

i?„,(g) = |/Cg|log (P_Z^^j^S5im\ + J2 log(gW) = l^gllog(Ag) + l«g(g[^])- (24) 

We argue that when m < g, we must have Ag < Am- Otherwise, if Ag > Am, due to the fact that 
itfcj ^ "lu^t have p'^>p';^,y k e /Cm, which impliesX^feGK;™ Pg > EfeeK™ Pm = Pw, a violation 
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of the total power constraint. 

Take any channel k* with h[A;*] > m[/c*] = g[k*], and define the best channel gains after the increase 
on channel k* as m* = m + e^. x 6 and g* = g + e^,. x 6, respectively. Let /Cm. and /Cg. denote the 
set of active channels. Comparing /Cm and /Cm*, we have the following four cases: ml) there exists 
an e > such that for all < 5 < e, /Cm = /Cm* ; m2) for all 6 > 0, ACm 3 /Cm* ; m3) for all 6 > 0, 
/Cm C /Cm*; in4) for all 6 > 0, /Cm / /Cm*- Similarly, we have four cases gl)-g4) comparing the sets 
/Cg and /Cg. . In the following we give the expression for lim^^^o for each of the 

cases ml)-m4). 

We first consider case ml). In the neighborhood of < 6 < e, i?^(m*) can be expressed as 



1 



Consequently, in the neighborhood of < 5 < e, we have 

5^0+ S p^ + J2keK^-^ A„,(m[fc*])2 m[k*Y 

We then consider case ml). This case is shown by 4 steps. 

(m2-l): We first show A;* € /Cm- If on the contrary. Am — ^^j'^.j < 0, then due to the continuity 
of Pm with respect to m, there must exist an e > such that for all < 5 < e, Pm* < 0' which 
is equivalent to Am* — yn* [ k*] ^ ^- "^^^^ impUes /Cm = /Cm* for < < e, which contradicts the 
assumption that /Cm 15 /Cm* , V 5 > 0. 

Sfep (m2-2) We then argue that /c* G /Cm*- Assume the contrary, then Am* — in'\k*] ^ ^ • Froi^ the 
previous step, we see that Am — ^^7|pj ^ 0. Then it must be the case that Am > Am* - Due to the fact that 

for all other channels k ^ k* , m[k] = m*[A;], then we must have pw = '}2ik=i Pm > ^k=i Pm* = Pw^ 
a contradiction. 

Step (m2-3) We have argued that k* must remain in the active set. Then for e small enough, there 
must exist a single channel k ^ k* such that k G /Cm but k ^ /Cm*^ for all < 5 < ^ The dual 
variables Am and Am* can be expressed as 

Am = 7]p— T ipw+ V —jpr ) , Am* = Tpr^^ ( Pw + V ^7T H FTTT^ ) • (26) 

l^ml \ m[k]l \ICm\-l\ , ^ m[k] m[k*]+Sj 

The difference between the above two dual variables is given by 



-A, 



1 I 1 



(o) -r ^ -T ^[j..] m[fe*]+5 

< Am - Am* = , ^ (27) 

where (a) is from the fact that m* > m, and use the same argument in the paragraph following 
(I24l ). Note that k e /Cm, then Am ^ > 0. Combine this with (|271), we have that —h^ rj^j-j > 

' I"' ™ m[fc] — ^ m[k'] m[k']+S — 

Am TTT > for arbitrary small 6 > 0. Then it must be true that Am ttt = 0. 

^[k\ m[k\ 



Step (m2-4) Using the result obtained in Step (m2-3) and the rate expression (1241 ). we can express 

'if for all 5 > 0, multiple channels leave ICm, then they must have the same magnitude-a probability event. Our argument 
can also be applied to this degenerate case, but for the sake of notational simplicity, we only present the single k case. 
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the difference of the rate Rw{m*) and i2^(m) as 

Rn,{m*) - R^u{m) = (|/Cm| - 1) log ( I iMK- I ) + log ' 

m(fc) 



(|/C„|-1)|/C„| ; m[fc 



/ir I I ■"["-] Is / n * 



(l^ml - 1) log I / ' ' + log 



-<|'=-|-i)'°H'+ iS^J+'-H-s^j (28) 

where in (a), (6) we have used the fact that Am = -^^y Using L'Hopital's rule, we obtain 

^.^ i?^,(m*) - R^jm) _ ^.^ " {m[r-']+5)i , 1 



(5^0+ (5 (5^0+ 1 I 1 m[fc] m[fc] (mf/c*! + (5) 

m[A:] 1 11 1 



(29) 



(m[fc*])2 m[fc*] Am(m[fc*])2 m[A:*] ' 

For the cases m3)-m4), the derivation is similar to the cases of ml)- m2). The key observation is still 
that the channel k* must satisfy A;* € /Cm and k* € /Cm* > and that the channel k that leaves or joins 
the set /Cm* must satisfy Am = For these cases, (|29l ) again holds true. 

Fix 5 < 0, and redo the above analysis by switching the role of m and m* for all four possible 
cases, we can obtain lim^_i.o- -^"'("' _ _|_ Consequently, we have that for all 

k* that satisfies h[/c*] > m[/E*] = g[A;*], the following is true 

For case gl)-g4), the exact same argument leads to the same result. In summary, we obtain 

R^i^*) ~ R^{^) R.Uni*)-RM ^ 1 1 11 1 1_ 

5™ S 6 Ag (g[/c*])2 + g[/c*] Am (m[/c*])2 m[fc*] ' ^ ' 

Recall that k* G Q, which means that m[A;*] = g[k*]. Using the fact that g > m, and Ag < Am, we 

conclude that (l22l ) is true for all k with h[A;] > m[A;] = g[A;]. 

Step 2) We then argue that for any channel k e Q, (ITtI) must be true. For any g > m > 0, pick 
€ Q, we have the following three cases: 1) h[A:] < m[fe]; 2) m[k] < h[k] < g[k]; 3) h[k] > g[k]. 

Verifying case l)-case 2) is straightforward. For case 3) we have 

i?,„(m V efeh[fc]) - i?,„(m) = i?,„(m V efeh[fc]) - i?,„(m V efeg[fc]) + i?,„(m V efeg[fc]) - i?„,(m) 

> i?,„(mVefch[fc]) -i?,„(mVefcg[fc]) (32) 

where the inequality is due to the monotonicity property. It is sufficient to show 

i?„(g V efeh[fc]) - i?„,(g) < i?„(m V ekh[k]) - R„{m V efeg[fc]), Vh>0, g>m>0. (33) 

Let m = m V efcg[A;] and h = g + 5^-6^, for some 6k > 0. Clearly m[k] = g[k]. Then to show (1331 ). 
it is sufficient to show that for all k such that m[k] = g[k], we have 

RUs + ekSk) - RUg) < Rw{m + ekSk) - i?^(rii), V 5 > 0, g > rii > (34) 

which reduces to the case in Step 1) (cf. condition (|2TI) ). We also have that ([TT] ) is true. 
Combining with our argument in Step 1), we conclude that (fTTT i is true for all G /C. 
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D. Proof of Theorem |?] 

Let c^*' denote the better-reply association at time t: c(*)[i] = w*^^\ Define two sets C and A: 
c € C ^ c appears infinitely often (i.o.) in {c^*)}^^, and a € ^ a i.o. in {a^*)}^]^. 

The first claim is that there exist a* G ^ that is a pure NE for game Q. Observe that the sets |^| > 
and \C\ > due to the finiteness of the possible association profiles. Suppose \A\ = 1, then the single 
element in A, say a*, must be a NE. Suppose \A\ > 1, and choose c* € C. Pick a time t such that 
c(*) = c*. Note that c*[i] is in the front of the memory for each user i, then with probability at least 
(i)^, a(*+i) = c*. This implies c* G A. If c* is a NE, then our claim is proved. If c* is not a NE, 
we will show that with positive probability, we can construct a finite sequence that leads to a NE. To 
this end, consider the following steps of operation. 

Step 1): With probability at least (jj)'^ , a(*+^) = c*. Because c* is not a NE, then there exists 
an i € A/^ such that c(*+^)[z] 7^ c(*)[z]. Similarly as in the proof of Theorem 3, we can show that 
i?(a*+^) < R (^[c(*+^)[i],af^^^^]y With probability at least {jj)^ , every user j / i samples c(*)[j], 
which is now at the second slot in the memory, while user i samples c(*+^)[i]. This event leads to 
a(i+2) ^ ^^(t+i) [^]^ a^+^^], and we have i2(a(*+2)) > ii(a(*+i)). Put index i into asetU:U = {i}. Note 
in this stage, we have: a(*+^) [i] = c(*+^) [i]. Continue this process, until we reach a time t + n < t + N 

(T) (T) 

such that only users in the set U are willing to switch, i.e., \f j G £, Cj = a^ . Note that the 

requirement M > N ensures that for all i, the set of best responses {c^"*^ is still in user z's 

memory. Let T = t + n. Let £ = J\f\U. 

Step 2): Observe that for all i ^U, there must exist a constant fcj such that < ki < n < N and that 
its current association a(-^)[i] is sampled from its fcjth memory, i.e., c(^~^*)[z] = a(^)[z]. Pick q E U 
that has the largest ki and is willing to switch at time T: q ^ argmaXjgj^^(T)[j]-^a(^)[i] ^i- We can now 
shift c^'^~^) out of the memory and still be able to construct a^^^^^ = [c^^^ [q], a^^] with positive 

probability, because all the elements in a^^ must have been appeared once in {c^'^^}f^rp_j^^i. Move 
q out of U and into iS, let T = T + 1 and continue Step 2) until only users in the set £ are willing to 
switch. Change the role of U and £, and continue Step 2). 

Repeating Step 2), we construct a sequence {i2(a(*+'))} that is strictly increasing. Due to the finiteness 
of the choice of a, there must exist a finite time instance T* after which it is not possible to find an 
association that differs from a(*+^* ^ with a single element and still have strict better system throughput. 
Consequently, a* = a^*"*"^*) is an equilibrium profile. Thus, with positive probability, a NE profile a* 
appears after a(*+^) infinite steps. Because a(*+^) = c*, with c* happens i.o., we must also have a* 
i.o., that is, a* G A. The claim is proved. 

The next claim is that the algorithm converges to a* with probability 1. Let {tk}'^^i denote the 

subsequence of {t} in which a* happens. Define the event: = Plfiii^'^*''^'^ — ^*}' ^^^^ i^' starting 
from a time tk, a* appears M+1 times consecutively. When Ck happens, we have: 1) at time t^+M+l, 
^{U+M+i) ^ because BRi{a*) = a*[i] V i; 2) a(*'=+^^+') = a* for alU > 1 because after time 
{tk + M + I), each user i's memory will solely consist of a*[i]. Note that if a^**") = a* appears, 
with probability at least (^7)^, a(*^+i) = a*. This impUes Pr(Cfc) > (jj)^''^^. Let denote the 
complement set of C^. We have: 

\t>l I \t=l / k=\ 

This says Pr(a(*) converges to some a* G A* eventually) = 1. ■ 
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1. Illustration of the simulation setting with A'^ = 10, 
= 6, D = 0.3. 
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Fig. 2. Three realizations of throughput. K = 512, A'^ = 20, 
W = 8. 
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Fig. 3. Averaged convergence time v.s. number of users. Each Fig. 4. Averaged convergence time v.s. number of BSs with 
point in this figure is averaged over 200 random networks, and w/o costs. Each point in this figure is averaged over 200 
W = 8, a: = 512, D = 0.4. random networks, d = 1 Mbit/sec, N = m, K = 512, 

D = {0.2, 0.5, 0.8}. 
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Fig. 5. Averaged system throughput v.s. number of BSs by 
different algorithms and the maximum achievable throughput. 
Each point in this figure is averaged over 100 random net- 
works. A/' = 10, i«: = 64, D = {0.4, 0.8}. 
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Fig. 6. Averaged system throughput v.s. number of BSs 
by different algorithms. Af = 30, if = 512, D = 
{0.2, 0.5, 0.8}. Each point in this figure is averaged over 
200 random networks. 
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Fig. 7. Empirical CDF of the per-BS rate. Each curve in 
this figure composed of the rate of the BSs over 100 random 
networks. = 8, iV = 30, A' = 512, D = {0.2, 0.8}. 
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Fig. 8. Averaged throughput ratio (CAPA over nearest Fig. 9. Comparison of the empirical CDF of the users' rates 

neighbor) v.s. distribution factor D. N — 30, K = 512, in the multicell cellular networks. = 40. Each curve in this 

W = {2, 4, 8, 10, 12}. Each point in this figure is averaged figure is the CDF of the users' rates in 100 generations of the 

over 200 random networks. network. 
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Fig. 10. Evaluation of the performance of the proposed Fig. 11. Construction of the network for clause Cq 
algorithm in the presence of channel estimation error. Each X2 V Xj, for CAPA resource allocation strategy, 
point in this figure is averaged over 200 random networks. 
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